59 research outputs found

    Diversity, mobility, and structural and functional evolution of group II introns carrying an unusual 3' extension

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Group II introns are widespread genetic elements endowed with a dual functionality. They are catalytic RNAs (ribozymes) that are able of self-splicing and they are also mobile retroelements that can invade genomic DNA. The group II intron RNA secondary structure is typically made up of six domains. However, a number of unusual group II introns carrying a unique extension of 53-56 nucleotides at the 3' end have been identified previously in bacteria of the <it>Bacillus cereus </it>group.</p> <p>Methods</p> <p>In the present study, we conducted combined sequence comparisons and phylogenetic analyses of introns, host gene, plasmid and chromosome of host strains in order to gain insights into mobility, dispersal, and evolution of the unusual introns and their extension. We also performed in vitro mutational and kinetic experiments to investigate possible functional features related to the extension.</p> <p>Results</p> <p>We report the identification of novel copies of group II introns carrying a 3' extension including the first two copies in bacteria not belonging to the <it>B. cereus </it>group, <it>Bacillus pseudofirmus </it>OF4 and <it>Bacillus sp</it>. 2_A_57_CT2, an uncharacterized species phylogenetically close to <it>B. firmus</it>. Interestingly, the <it>B. pseudofirmus </it>intron has a longer extension of 70 bases. From sequence comparisons and phylogenetic analyses, several possible separate events of mobility involving the atypical introns could be identified, including both retrohoming and retrotransposition events. In addition, identical extensions were found in introns that otherwise exhibit little sequence conservation in the rest of their structures, with the exception of the conserved and catalytically critical domains V and VI, suggesting either separate acquisition of the extra segment by different group II introns or a strong selection pressure acting on the extension. Furthermore, we show by in vitro splicing experiments that the 3' extension affects the splicing properties differently in introns belonging to separate evolutionary branches.</p> <p>Conclusions</p> <p>Altogether this study provides additional insights into the structural and functional evolution of unusual introns harboring a 3' extension and lends further evidence that these introns are mobile with their extension.</p

    The Chlamydomonas genome project: A decade on

    Get PDF
    The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis, and micronutrient homeostasis. Ten years since its genome project was initiated an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the omics era. Housed at Phytozome, the plant genomics portal of the Joint Genome Institute (JGI), the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of whole transcriptome sequencing (RNA-Seq) data. We present here the past, present, and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions, and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes

    SuperCAT: a supertree database for combined and integrative multilocus sequence typing analysis of the Bacillus cereus group of bacteria (including B. cereus, B. anthracis and B. thuringiensis)

    Get PDF
    The Bacillus cereus group of bacteria is an important group including mammalian and insect pathogens, such as B. anthracis, the anthrax bacterium, B. thuringiensis, used as a biological pesticide and B. cereus, often involved in food poisoning incidents. To characterize the population structure and epidemiology of these bacteria, five separate multilocus sequence typing (MLST) schemes have been developed, which makes results difficult to compare. Therefore, we have developed a database that compiles and integrates MLST data from all five schemes for the B. cereus group, accessible at http://mlstoslo.uio.no/. Supertree techniques were used to combine the phylogenetic information from analysis of all schemes and datasets, in order to produce an integrated view of the B. cereus group population. The database currently contains strain information and sequence data for 1029 isolates and 26 housekeeping gene fragments, which can be searched by keywords, MLST scheme, or sequence similarity. Supertrees can be browsed according to various criteria such as species, isolate source, or genetic distance, and subtrees containing strains of interest can be extracted. Besides analysis of the available data, the user has the possibility to enter her/his own sequences and compare them to the database and/or include them into the supertree reconstructions

    Rapid Multi-Locus Sequence Typing Using Microfluidic Biochips

    Get PDF
    sequencing of 6–8 housekeeping loci to assign unique sequence types. In this work we adapted MLST to a rapid microfluidics platform in order to enhance speed and reduce laboratory labor time. isolated in this study from one location in Rockville, Maryland (0.04 substitutions per site) was found to be as great as the global collection of isolates.Biogeographical investigation of pathogens is only one of a panoply of possible applications of microfluidics based MLST; others include microbiologic forensics, biothreat identification, and rapid characterization of human clinical samples

    Genotyping of Bacillus cereus Strains by Microarray-Based Resequencing

    Get PDF
    The ability to distinguish microbial pathogens from closely related but nonpathogenic strains is key to understanding the population biology of these organisms. In this regard, Bacillus anthracis, the bacterium that causes inhalational anthrax, is of interest because it is closely related and often difficult to distinguish from other members of the B. cereus group that can cause diverse diseases. We employed custom-designed resequencing arrays (RAs) based on the genome sequence of Bacillus anthracis to generate 422 kb of genomic sequence from a panel of 41 Bacillus cereus sensu lato strains. Here we show that RAs represent a “one reaction” genotyping technology with the ability to discriminate between highly similar B. anthracis isolates and more divergent strains of the B. cereus s.l. Clade 1. Our data show that RAs can be an efficient genotyping technology for pre-screening the genetic diversity of large strain collections to selected the best candidates for whole genome sequencing

    Tridimensional model structure and patterns of molecular evolution of Pepino mosaic virus TGBp3 protein

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Pepino mosaic virus </it>(PepMV) is considered one of the most dangerous pathogens infecting tomatoes worldwide. The virus is highly diverse and four distinct genotypes, as well as inter-strain recombinants, have already been described. The isolates display a wide range on symptoms on infected plant species, ranging from mild mosaic to severe necrosis. However, little is known about the mechanisms and pattern of PepMV molecular evolution and about the role of individual proteins in host-pathogen interactions.</p> <p>Methods</p> <p>The nucleotide sequences of the triple gene block 3 (TGB3) from PepMV isolates varying in symptomatology and geographic origin have been analyzed. The modes and patterns of molecular evolution of the TGBp3 protein were investigated by evaluating the selective constraints to which particular amino acid residues have been subjected during the course of diversification. The tridimensional structure of TGBp3 protein has been modeled <it>de novo </it>using the Rosetta algorithm. The correlation between symptoms development and location of specific amino acids residues was analyzed.</p> <p>Results</p> <p>The results have shown that TGBp3 has been evolving mainly under the action of purifying selection operating on several amino acid sites, thus highlighting its functional role during PepMV infection. Interestingly, amino acid 67, which has been previously shown to be a necrosis determinant, was found to be under positive selection.</p> <p>Conclusions</p> <p>Identification of diverse selection events in TGB3p3 will help unraveling its biological functions and is essential to an understanding of the evolutionary constraints exerted on the <it>Potexvirus </it>genome. The estimated tridimensional structure of TGBp3 will serve as a platform for further sequence, structural and function analysis and will stimulate new experimental advances.</p

    Integrated Assessment of Genomic Correlates of Protein Evolutionary Rate

    Get PDF
    Rates of evolution differ widely among proteins, but the causes and consequences of such differences remain under debate. With the advent of high-throughput functional genomics, it is now possible to rigorously assess the genomic correlates of protein evolutionary rate. However, dissecting the correlations among evolutionary rate and these genomic features remains a major challenge. Here, we use an integrated probabilistic modeling approach to study genomic correlates of protein evolutionary rate in Saccharomyces cerevisiae. We measure and rank degrees of association between (i) an approximate measure of protein evolutionary rate with high genome coverage, and (ii) a diverse list of protein properties (sequence, structural, functional, network, and phenotypic). We observe, among many statistically significant correlations, that slowly evolving proteins tend to be regulated by more transcription factors, deficient in predicted structural disorder, involved in characteristic biological functions (such as translation), biased in amino acid composition, and are generally more abundant, more essential, and enriched for interaction partners. Many of these results are in agreement with recent studies. In addition, we assess information contribution of different subsets of these protein properties in the task of predicting slowly evolving proteins. We employ a logistic regression model on binned data that is able to account for intercorrelation, non-linearity, and heterogeneity within features. Our model considers features both individually and in natural ensembles (“meta-features”) in order to assess joint information contribution and degree of contribution independence. Meta-features based on protein abundance and amino acid composition make strong, partially independent contributions to the task of predicting slowly evolving proteins; other meta-features make additional minor contributions. The combination of all meta-features yields predictions comparable to those based on paired species comparisons, and approaching the predictive limit of optimal lineage-insensitive features. Our integrated assessment framework can be readily extended to other correlational analyses at the genome scale

    PPR proteins - orchestrators of organelle RNA metabolism.

    Get PDF
    Pentatricopeptide repeat (PPR) proteins are important RNA regulators in chloroplasts and mitochondria, aiding in RNA editing, maturation, stabilisation or intron splicing, and in transcription and translation of organellar genes. In this review, we summarise all PPR proteins documented so far in plants and the green alga Chlamydomonas. By further analysis of the known target RNAs from Arabidopsis thaliana PPR proteins, we find that all organellar-encoded complexes are regulated by these proteins, although to differing extents. In particular, the orthologous complexes of NADH dehydrogenase (Complex I) in the mitochondria and NADH dehydrogenase-like (NDH) complex in the chloroplast were the most regulated, with respectively 60 and 28% of all characterised A. thaliana PPR proteins targeting their genes

    The two tryptophans of β2-microglobulin have distinct roles in function and folding and might represent two independent responses to evolutionary pressure

    Get PDF
    We have recently discovered that the two tryptophans of human β2-microglobulin have distinctive roles within the structure and function of the protein. Deeply buried in the core, Trp95 is essential for folding stability, whereas Trp60, which is solvent-exposed, plays a crucial role in promoting the binding of β2-microglobulin to the heavy chain of the class I major histocompatibility complex (MHCI). We have previously shown that the thermodynamic disadvantage of having Trp60 exposed on the surface is counter-balanced by the perfect fit between it and a cavity within the MHCI heavy chain that contributes significantly to the functional stabilization of the MHCI. Therefore, based on the peculiar differences of the two tryptophans, we have analysed the evolution of β2-microglobulin with respect to these residues

    Complete Bacteriophage Transfer in a Bacterial Endosymbiont (Wolbachia) Determined by Targeted Genome Capture

    Get PDF
    Bacteriophage flux can cause the majority of genetic diversity in free-living bacteria. This tenet of bacterial genome evolution generally does not extend to obligate intracellular bacteria owing to their reduced contact with other microbes and a predominance of gene deletion over gene transfer. However, recent studies suggest intracellular coinfections in the same host can facilitate exchange of mobile elements between obligate intracellular bacteria—a means by which these bacteria can partially mitigate the reductive forces of the intracellular lifestyle. To test whether bacteriophages transfer as single genes or larger regions between coinfections, we sequenced the genome of the obligate intracellular Wolbachia strain wVitB from the parasitic wasp Nasonia vitripennis and compared it against the prophage sequences of the divergent wVitA coinfection. We applied, for the first time, a targeted sequence capture array to specifically trap the symbiont's DNA from a heterogeneous mixture of eukaryotic, bacterial, and viral DNA. The tiled array successfully captured the genome with 98.3% efficiency. Examination of the genome sequence revealed the largest transfer of bacteriophage and flanking genes (52.2 kb) to date between two obligate intracellular coinfections. The mobile element transfer occurred in the recent evolutionary past based on the 99.9% average nucleotide identity of the phage sequences between the two strains. In addition to discovering an evolutionary recent and large-scale horizontal phage transfer between coinfecting obligate intracellular bacteria, we demonstrate that “targeted genome capture” can enrich target DNA to alleviate the problem of isolating symbiotic microbes that are difficult to culture or purify from the conglomerate of organisms inside eukaryotes
    corecore